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IMAGE PROCESSOR 
This application is based on application No. 2000-93962 filed in 
Japan, the contents of which are hereby incorporated by reference. 

5 BACKGROUND OF THE INVENTION 
FIELD OF THE INVENTION 

The present invention relates to image processing on a document 
including different types of images. 

10 DESCRIPTION OF PRIOR ART 

A document may include different types of images such as 
characters, line images, pictures and characters not recognized. It is known to 
process different types of images separately and to compose them to integrate 
a document image. For example, bit map images are generated from character 

15 codes, from characters not recognized and from non-character images and they 
are composed (Japanese patent laid open publication 9-91371/1997). In 
another way, in order to use rounded characters, character images are modified 
without using font of the rounded characters, and they are composed with the 
other image (Japanese patent laid open publication 5-134651/1993). In a 

20 different way, when an image data is converted to another image data of a 
different format, regions of characters, line images and pictures are converted in 
different forms at the same time, and the image is reconstructed (Japanese 
patent laid open publication 5-20495/1993). 

In an image processing system, different types of regions are 

25 separated from bit map data obtained by scanning a document. Character 
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regions are converted to code data with optical character recognition, line image 
regions are converted to vector data, and the other regions are processed as bit 
map data. Then, these data are composed and the resultant integrated image 
is outputted. However, the images may not be composed fitly in the integrated 
5 image. For example, when vector data are generated, the precision of the 
position of the edited image may not be correct sufficiently. Therefore, when 
vector data are composed with the original image, the original image is 
superposed around the vector data, so that a blurred image may be outputted. 
For example, in an image wherein colors are different at both sides of a straight 
10 line, the image around the straight line may be blurred, or in an image including 
a bit map image enclosed with vector data, the boundary thereof may be blurred. 

SUMMARY OF THE INVENTION 

An object of the present invention is to provide an apparatus and 
1 5 method for composing pictures of different types fitly. 

An image processor according to the invention comprises a first 
converter which extracts a line image region in input bit map image data and 
converts the line image to vector data, a second converter which converts bit 
map data of pixels in the input bit map image data around the line image of the 
20 line image region based on the bit map data of pixels around the line image 
region, and a composer which composes the vector data of the line image 
obtained by said first converter and the bit map data converted by said second 
converter. 

An advantage of the present invention is that a bit map image 
25 including a line image can be reproduced without position shift in an image 
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vector data. 

RRIEF DESCRIPTION OF THE DRAWINGS 

These ana other o bj ects - — o, the presen, invention - 
h me d. .he following description taKen in function with me 
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processor; 
processor; 
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Fig . 3 is a diagram for explaining me image data processing; 
Fig 4 is a flowchart of image synthesis, 
Fig 5 is a diagram o, an exampie of an input bit map image; 
Fig . 6 is a diagram of an exampie of a bi-ievel image; 
Fig . 7 is a diagram of an example of vector da.a; 

Fig 8 is a diagram of position detection; 

Fig 9 is a diagram for explaining a position for embedding; 



embedding; 



Fig 1 1 is a diagram of an example of a composed image; and 



scanner. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

d esigna,e like or correspond par* -oughou, «" — *~ 
shows an image processor ivj „ OTXA/n rk 300 

- bv a scanner 200 and sends print da* of the image data v,a a network 300 

: , map image da, are processed differently. - image «0 

interface 102 a binarizer 104, a character 
has components such as a scanner interface 102, 

.cognition de.ce 10S, a vector converter 10, a bit map proctor 11* 

components will be explained later. 

The image processing in me components 104 to 114 may 

m another way, it may be performed 
performed by dedicated hardware crcuits. in another 

by a software program o, image processing by a computer such as a persona 

work station which uses an image processing software program 
computer or a work station, wh ^ 
„ example as shown in Fig. 2, a wu i^ 

lika a random access memory 124 usea 
. or the like for storing a program or the like, a random 

) or me means, and a 

as a wo* area, and a keyboard 126 and a mouse 128 as inpu 

- • 130 The CPU 120 is also connected to storage devices such as 
display device 130. The CPU ^ 
a flexible disk drive 132 for accessing a flexible d,sk 132, 

> -rn ROM drive 136 for accessing a 
for accessing a hard disk (no, show.) and a CD-ROM 

> 5 CD-ROM 137. A program for image processing may be stored 
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recording media. The CPU 120 is also connected to an input/output device 138 
(used as the scanner interface) and a communication device 140 (used as the 
output interface). The image processing program stored in one of recording 
media is read and written to the RAM 124, and the CPU 120 executes the 
5 image processing program in the RAM 124. This modified example is not 
explained further below. 

Image data are processed by the image processor 100 as follows. 
Fig. 3 is provided to understand the image processing explained below. First, 
character regions are processed to recognize characters as follows. The 

10 binarizer 104 binarizes the input color bit map image data and sends the bi-level 
image to the character recognizer 106. The character recognizer 106 extracts 
character regions from the input bi-level image data, performs optical character 
recognition on the character regions and stores the obtained character data 
including character codes, region position data and character attribute. Then, 

15 the character recognizer 106 sends to the vector converter 108 the image data 
on the character regions obtained by deleting and approximating the image data, 
as well as the input color bit map image data. Next, line image regions are 
processed. In a vector converter 108, line image regions (or regions 
approximated as line images) are extracted from the bi-level image data and 

20 are converted and stored to vector data including start point, end point, vector 
type, line width and line attribute. On the other hand, the color bit map image 
data in the line image regions is approximated, and the resultant color bit map 
image data are sent to the bit map processor 110. The approximation will be 
explained later. Finally, the color bit map image data are processed. The bit 

25 map processor 110 divides the data into regions and extracts picture regions. 
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Then, for each of the extracted picture regions, the region is discriminated and 
the data is approximated, and the obtained bit map data including position of 
the picture region and color bit map image data are stored on the extracted 
picture regions. 

5 In the composer 112, the magnifying factor of each image is 

changed, and a bit map image of characters obtained from the character data, a 
bit map image obtained from the vector data and a picture image obtained from 
the bit map data are composed. Then, the format converter 114 converts the 
composed image to a predetermined format and outputs the result via an output 
1 0 interface 1 1 4 to the printer 400. 

In the above-mentioned image processing, the color bit map 
image data obtained by the scanner 200 are divided into regions. In the 
character regions, character data are determined with optical character 
recognition. Then, vector data are obtained on line image regions. In the other 
15 regions, bit map data are obtained. Then, the three kinds of data are composed 
and outputted. In the processing, a line image having background is converted 
to vector data by the vector converter 108. On the other hand, as to the 
background of the line image approximated as vector data and regions 
thereabout, the original pixel data of the background are not adopted as the bit 
20 map image data of the background, while the approximation is performed based 
on data of pixels around the line image. For example, as to a line image of 
/color bit map image data and pixel data of the regions around the line image, bit 
\ map image is approximated by the pixels at the regions around the line image, 
J and the line image defined by the vector data is composed therewith. Therefore, 
25 \ position shift or mismatching between background and vector data becomes 
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unnoticeable, irrespective of the precision of approximated position. On the 
other hand, the color of background is likely changed at the boundary line of a 
straight line, an arc, an enclosed image or the like. Then, the pixel data to be 
approximated is selected according to a side relative to the approximated line 
5 where the pixel to be deleted exists. 

Next, the image composition of various types of images is 
explained according to the flowchart shown in Fig. 4 of the image processor 100. 
The processing of a straight line is explained as a type of line image. First, the 
RGB color bit map image data is received from the scanner 200, and the input 

10 image data is subjected to binarization for vector conversion to generate bi-level 
image data while keeping the color bit map image data (S10). Fig. 5 shows an 
image scanned on a document having different colors of background at two 
sides of a straight line, or a boundary, as an example of the input bit map image. 
Fig. 6 shows bi-level image at the boundary shown in Fig. 6. If a character 

15 region exists, though not shown in the drawings, the above-mentioned optical 
character recognition is performed on the bi-level image in the character region 
to determine character data (S12). 

Next, line image is processed. Candidate pixels to be 
approximated as a straight line are extracted (S14), and they are approximated 

20 as a straight line (S16). Fig. 7 shows an example of vector data obtained from 
the bi-level image shown in Fig. 6, that is, line width, Iw, starting point (bx, by), 
and end point (ex, ey) of a straight line, and width ew, bw of the approximation 
region at the right and left sides of the straight line. The approximation of 
straight line may be performed by known technique such as Hough conversion 

25 or least square method. 
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A table (Table 1 ) is generated on the vector data of the start point, 
the end point, the line widths and the region approximated based on the bi-level 
image. In the image shown above, only one straight line is included. If a 
plurality of straight lines exist, the table includes vector data of the plurality of 
5 obtained straight lines. 



Table 1 Vector data table 



Straight 
line 


Start point 


End point 


Line 
width 


Approximation 
region 


bx 


by 


ex 


ey 


Iw 


bw 


ew 


1 


10 


50 


10 


150 


5 


10 


10 


2 


60 


50 


60 


150 


5 


10 


10 


3 


110 


50 


110 


150 


5 


10 


10 


4 


160 


50 


160 


150 


5 


10 


10 


















































N 


bx[N] 


by[N] 


ex[N] 


ey[N] 


lw[N] 


bw[N] 


ew[N] 



10 Next, the number of straight lines is set to N (S18), and each of 

the N lines is processed as follows. For the bit map image data, P is set to the 
number of pixels in an approximation region approximated as a straight line in 
the bit map image data (S20). Then, for each of the pixels in the approximation 
region, the pixel position, that is, a side relative to the straight line at which the 



9 



object pixel exists, is detected (S22). Fig. 8 shows an example of detection of 
pixel position. For example, as shown in Fig. 9, straight lines approximated are 
represented as functions f(x), g(y), the position of pixel (c, d) is at the minus 
side if the signs of f(c)-d, g(d)-c are all positive and at the plus side if the signs 
5 are all negative. Next, the pixel value is selected according to the detected 
pixel position (S24) and is embedded at the object pixel at the above position in 
the bit map image (S26). At this step, a pixel to be approximated is selected 
according a side relative to the approximated line where the object pixel exists. 
For example, in order to determine the value to be embedded for pixel (c, d), a 

10 straight line h(c) is determined to extend through the pixel (c, d) perpendicularly 
to the approximated line. If the pixel position detected at step S20 is the minus 
direction, a boundary straight line f1(x) of the approximation region at the minus 
direction is determined, and as to a pixel distant by predetermined value n from 
the crossing point of f1 (x) and h(x) in the minus direction of h(x), the value of the 

15 pixel is set to the value for embedding. Then, P is decremented (S28), and the 
flow returns to step S22. The above processing is repeated for all the pixels to 
determine the values to be embedded. Fig. 10 shows an example of a bit map 
image after the embedding. It is apparent that position shift between the 
approximated line and the background color does not appear. Next, N is 

20 decremented (S30), and the flow returns to step S20. Thus, the above 
processing is repeated for all the straight lines. It is to be noted that the 
embedding is performed similarly for the pixels at which the straight line exists. 

Next, the above-mentioned bit map data are obtained on the 
regions other than the character and line image regions (S32). Finally, the bit 

25 map data, vector data and character data are composed to form an integrated 
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image (S34). Then, it is converted to a general format which can deal with bit 
map image, vector image and character image, and the converted image is 
outputted. By approximating the color bit map image data as explained above, 
the bit map image fits with the position of an approximated line of the line image 
5 at boundaries of line images, and color shift around the boundaries is prevented. 
Fig. 1 1 shows an example of a composed image. By composing the bit map 
image approximated in Fig. 10 with the vector image of a straight line defined by 
vector data, mismatching between the straight line and the background is 
prevented, as shown in Fig. 11. 

10 In the above example, approximation as straight line is explained 

on a line image region. However, approximation as an arc, Bezier curve or the 
like can be processed similarly. 

Further, the approximation is applied to a closed region filled with 
a color, and position mismatching can be prevented. 

15 Fig. 12 shows a modified system for the above-mentioned image 

processing. In this system, a scanner 200' has an image reader section 102' 
where a document is scanned to obtain output bit map data of the document. 
Further, the scanner 200' includes components such as a binarizer 104', a 
character recognition device 106', a vector converter 108', a bit map processor 

20 110', a composer 112', a format converter 114' and an output interface 116'. 
The bit map data obtained by the image reader section 102' is sent to the 
binarizer 104'. The components 104' to 114' are similar to the counterparts 
shown in Fig. 1, and they are not explained here again. 

As explained above, line image data represented as vector data 

25 are composed with bit map images approximated around the line image regions. 
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Then, an image having different colors at both sides of a straight line or the like, 
or an image around a boundary between an image of enclosed line and an bit 
map image in the enclosed line can be reproduced, without position shift. 

By converting character regions to character codes and line image 
regions to vector data, while keeping bit map data in the other regions, the 
memory capacity for image data is decreased, and processing of the image 
data with a computer becomes easier. Further, because the line images can be 
reproduced as raster data by raster image processing in a print controller, 
characters and line images would not be affected by noises on reading or by 
resolution. Then, image quality of a copy is as good as that of an original print. 

Although the present invention has been fully described in 
connection with the preferred embodiments thereof with reference to the 
accompanying drawings, it is to be noted that various changes and 
modifications are apparent to those skilled in the art. Such changes and 
modifications are to be understood as included within the scope of the present 
invention as defined by the appended claims unless they depart therefrom. 



